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Abstr act. We investigate Fano schemes of conditionally generic intersections, 
i.e. of hypersurfaces in projective space chosen generically up to additional con- 
ditions. Via a correspondence between generic properties of algebraic varieties 
and events in probability spaces that occur with probability one, we use the 
obtained results on Fano schemes to solve a problem in machine learning. 



1. Introduction 

Properties of generic points of a variety, or generic members of a family of vari- 
eties, are often much easier to study than their non-generic counterparts. An obvi- 
ous reason why generic properties are simpler to study is that open sets in algebraic 
geometry are also dense, and so generic properties are topologically very natural 
ones to consider. One example of this phenomenon is the family of lines on a hyper- 
surface in projective space, i.e. the hypersurface's Fano scheme of lines: the dimen- 
sion, smoothness and connectedness of this Fano scheme for a generic hypersurface 
were determined in B VdV79 , whereas its dimension for arbitrary smooth hypersur- 
faces is still unknown except in certain cases |Col79| IHMP981 IBeh06| ILR09| ILT10] . 
The conjectured dimension in general is the content of the Debarre-de Jong con- 
jecture |Deb01j . This paper studies Fano schemes of fc-planes for intersections of 
hypersurfaces in P n that are chosen generically up to some additional property, 
which we call conditional genericity. Our main results characterize the dimension 
of these Fano schemes, with the chosen conditions coming from an application to 
machine learning. 

If generic properties are sometimes the only ones accessible in algebraic geometry, 
they are in a sense the only ones of importance in statistics and machine learning. 
To be more precise, given a probability space (ft, A, fx), an event E occurs almost 
surely if it occurs with probability one, that is, if (i(E) = 1. Suppose that fl is 
an algebraic variety, and fi is a continuous measure. If generic events in SI are 
taken to be (Zariski) open, measurable subsets, then non-generic events occur with 
probability zero. Roughly speaking, non-generic events may be ignored in statistics 
and machine learning because they never occur. 

Our application to machine learning involves stationary subspace analysis (SSA) 
vBMKM09 , a method for multivariate time series analysis that has been applied 
to brain-computer interface research. We frame the problem and its relation to 
Fano schemes in Section [3J but present here one simplified application from brain- 
computer interface research. Suppose our goal is to isolate a brain signal for a 
particular task from other signals that vary with time and are irrelevant to this 
task, e.g. we seek to separate one type of brain activity from other activity related 
to environmental factors or so-called alpha oscillations due to fatigue. The data is 
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divided into N epochs, with each epoch modeled as a random variable. In Section 
3, we develop a precise identifiability criterion for SSA — i.e. how many epochs arc 
necessary to identify the stationary signal — in terms of the dimension of a certain 
Fano scheme. 

We now describe the structure of the paper. Section [2] develops results on Fano 
scheme of various conditionally generic intersections. In Section 12.11 we study the 
Fano scheme of generic intersections of hypersurfaces under the assumption that all 
hypersurfaces contain some fixed A:-plane. Section |2~21 restricts to Fano schemes of 
intersections of quadric hypersurfaces, but conditions the genericity assumption on 
the quadrics both containing a fixed fc-plane and having a fixed rank. We conclude 
with Section[3l where we relate results of the previous sections to machine learning. 

Notation and conventions: All work below is carried out over the complex 
numbers, although the results hold for any algebraically closed field. Unless other- 
wise specified, dimension will refer to projective dimension; e.g. dim0 = — 1. For a 
linear subspace A C P" of dimension k, we write [A] for the corresponding element 
of the Grassmannian Gr(fc, n), and when considering a vector subspace S instead 
as a projective subpace, we write ¥S. 

For the vector space of homogeneous polynomials of degree d in n + 1 variables 
we write r ra+ i(<i). For a mulitidegree d — (di, . . . ,d s ), the product of the corre- 
sponding vector spaces of polynomials will be denoted by r n+ i(d). 

A brief note for the reader coming from outside of algebraic geometry: the word 
"scheme" can safely be replaced with "variety" in almost all appearances below. 
The second author has also prepared a companion set of notes to this paper for 
those with rudimentary knowledge of algebraic geometry [Larl2j . 

Acknowledgements: We would like to thank Bernd Sturmfels for introducing 
us to one another, for suggesting the connection to Fano schemes, and for his 
encouragement throughout. We are grateful to Fabian Miiller for his invaluable 
help at numerous junctures during this project. We would also like to thank Luke 
Oeding for his insights on symmetric tensors. Several of the results of this paper 
were first tested with Macaulay2 [GSlOj and MATLAB(R) /Octave. 

2. Fano schemes of intersections of conditionally generic 

hypersurfaces 

The kth Fano scheme of a projective variety A, denoted Fk(X), is the sub- 
scheme of the Grassmannian Gr(fc,n) parametrizing fc-planes contained in X. If 
f^\ . . . , are defining equations of A, with degf^ — di, then we write A = 
V(f), where f = (f^\ . . . , jn s '), and call d = (di, . . . , d s ) the multidegree of f . 

The Fano scheme Fk(V(i)) has been studied in [DM98 under the assumption 
that f e r n+ i(d) is chosen generically. Let n, k, s £ N and d = (di, . . . , d s ) € N s , 
with all di > 2, and set 



Theorem 2.1 (DM98 ). Let [f] = . . . , /„)] € Pr„+i(d) be generic, and sup- 
pose that d ^ (2). 

(1) If 5(n, d, fc) < 0, then the Fano scheme Fk(V(t)) is empty. 

(2) If 5{n,d,k) > 0, then the Fano scheme Fk(V(f)) is smooth of dimension 




5(n, d, k). 
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The case of a single quadric is also settled in |DM98j using different argumentation, 
but the Fano scheme of a single hypersurface carries little interest for our application 
to machine learning (see Section [3]) . Therefore we assume in the remainder that 
d^(2). 

We extend these results to conditionally generic [f] G Pr„+i(d) in two ways. 
First, in Section 12. 1\ we take the defining equations to be generic up to the as- 
sumption that each hypersurface V(f^) contains a fixed fc-plane A'. And second, 
in Section 12.21 we restrict to intersections of quadrics that are generic, conditional 
upon all quadrics having a fixed rank r and containing a fixed /c-plane. Our ap- 
plication to machine learning involves determining when, under the conditional 
genericity mentioned above, there is equality Fk(V(i)) = [A'], i.e. when identi- 
fiability is achieved (see Section [3] for details). Thus determining when the Fano 
scheme has dimension is especially important. 

2.1. Generic intersections containing a common subspace. Suppose first 
that S(n,d,k) > 0. The analogue of statement (2) in Theorem 12.11 for generic 
[f] G Pr n+ i(d) such that A' C V(f) is a straightforward corollary. 

Corollary 2.2. Let A' G Gr(fc,n) be generic, and let [f] G Pr„ + i(d) be generic 
such that A' C V(f). If 5(n, d, k) > 0, then dimF fe (V(f)) = d{n,d,k). 

Proof. Let S\/ (d) C r n+ i(d) be the subpace consisting of all s-tuples f such that 
A' C V(f). This is a codimension ( d £ fc ) subspace. Consider the incidence corre- 
spondence 

I = {([{}, [A]) : A C V(f)}C Pr n+1 (d) x Gr(fc,n) 




Pr n+1 (d) Gr(*,n). 

The proof of Theorem 12 . 1 1 involves showing that, under the conditions of part (2), p 
is dominant, and hence by the theorem of the dimension of the fiber, there exists an 
open set U C Pr n+ i(d) over which all fibers — which are Fano schemes Fk(V(i)) — 
have the expected dimension S(n, d, k). The corollary will follow after showing that 
for generic A' G Gr(fc, n), the intersection P^A'(d) n U is non-empty, and hence 
dense in PSA'(d). 

Set Gr = Gr(( d ^") - ( d + k ) - 1, (^J 1 ) - 1), so that [S A '(d)] G Gr. Note that p 
factors as follows: if J = {([f], [S A 'D : f € S A >(d)} C Pr n+ i(d) x Gr, then 

p-.i^i Apr„ +1 (d) 

But dominance of p implies that p _1 (C/) is dense in /, so for generic [A'] it follows 
that [A'] is in some fiber of p over U, and hence PS'A'(d) n U ^ 0, as required. □ 

Next we show that the remaining cases where 8(n, d, k) < result in identifiability. 

Theorem 2.3. Let [A'] G Gr(fc,n) be generic, and let [f] G Pr n+ i(d) be generic 
such that A' C V(f). IfS(n,d,k) < 0, then F k (V{f)) = [A']. 

Proof. For the incidence correspondence / above, the fibers of q are all subspaces 
of the same dimension in Pr„ + i(d). In the situation of Theorem 12.31 however, the 
incidence corresponence restricts to those ([f], [A]) such that A and A' are contained 
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in V(f), where A' is fixed. The fibers of q are no longer projective spaces of the 
same dimension, but rather their dimension depends on the intersection An A'. 

As in the proof of Corollary 12.21 we denote the subpace of Pr„ + i(d) consisting 
of [f] G PT n+ i(d) that vanish on A' by PS A '(d). For [A] G Gr(jfe,n) such that 
dim A n A' = k', k' G { — 1, ■ ■ ■ , k}, we can choose a basis for C n+1 such that A = 
P Span(eo, . . . , eu) and A' = P Span(efc_fc/ , . . . , e2k-k')- If we replace the original 
incidence correspondence with I k > = {([f], [A]) : A', A C V(f), dim A n A' = k'}, 
then the non-empty fibers of q over Gr(fc, n) are now projective spaces of a constant 
dimension, and the non-empty fibers of p over P S\> (d) are the following subvarieties 
ofF k (V(f)), 

F k ,k' = {[A] G F k (V(f)) : dim An A' = k'}. 

The natural description of the image of the second projection, q, is via Schubert 
cells. Given a complete flag of projective subspaces Wi CP", 

£ W C . . . C W n = P" 

such that A' = W k , and a non- increasing integral vector A = (Ao,...,Afc) with 
< Aj < n — k, the Schubert ce/fl £a C Gr(fe, n) is 

£ A = {[A] : dim A n W n - k+i - Xi > i for all i}. 

The codimension of £a in Gr(k, n) is Yli=o ( see 1GH78] for further details). 

For example, if n = 6, k = 1, and k' = 0, then Fi i o(V'(f)) is the intersection of 
Fi((f)) with the interior of £4,0; if instead k = 2, then F2 } o(V(f)) is obtained by 
intersecting with the interior of £2,0,0; and if k = 2 and k' = 1, then we intersect 
with the interior of £3,3,0 to obtain i*2,i(V(f)). In general, for — 1 < k' < k, 



F k , k ,(V(f))=F k (V(f))nZl 



2k+k' ,...,n-2k+k' ' 



(k' + l)-timcs 

where we use the common abbreviation of omitting the components of A equal 0. 
We now write the incidence correspondence of interest as 

I k ' = {([f],[A]) : A, A' C V(i), dimAn A' = fc'}CP5 A -(d) x £° 

PSA'(d)'" 

To prove Theorem 12.31 we determine the expected dimension of F kl k' for each 
k' G { — l,...,fc}, and then show that under the assumption S(n,d, k) < 0, all 
expected dimensions are negative except when kl = fc, i.e. when Fk y k' = [A']. 

The map q is surjective with fibers projective spaces of the same dimension, so 
Ik' is smooth and irreducible of codimension ( d ~^. k ) — ) in P5A'(d) x £°, and 

dim/fe k ' = (k+ l)(n - k) - (k' + l)(n - 2k + k') 



We use the less-common practice of denning Schubert cells via projective, rather than afBnc, 
dimension. 
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Subtracting dim. Ik' from dim P S\> (d) implies that the expected dimension of Fk,k' 



is 



(2.1) S(n,d,k,k') = (k-k')(n-k + k' + l)+ ( d ^, fc ') - ( d fc fc )- 

Here we take (_^) = for any r € N. 

To finish the proof, it suffices to show that all expected dimensions are negative 
(except when k' — fc, when the dimension is 0). We denote the first forward differ- 
ence of S(n, d, k, k') with respect to k' by A(fc') — S(n, d, k, k' + 1) — S(n, d, k, k'), 
and the second forward difference by A 2 (fc') = A(k' + 1) — A(fc'). Then 

'd + k' s 



A(fc') = -2k' -n + 2k 



k' + l 



Since we assume d ^ (2), A 2 (fc') is non-negative for k' G { — 1, ■ • ■ , k}, and hence 
d(n, d, fc, k') is convex for these k! . Note further that S(n, d, fc, —1) = 5(n, d, k), and 
<5(n, d, fc, fc) = 0. By assumption, <5(n, d, fc) < 0, so all expected dimensions except 
for k' = k are negative. 

It follows that the map pk> cannot be dominant for k' G {— 1, . . . , k — 1}, and so 
the variety Fk,k> will be empty for generic [f] G P5A'(d). Since Fk,k = [A'], this 
completes the proof. 

□ 

We conclude by studying the case S(n, d, k) — 0. Replacing Pr„ +1 (d) by S° 
for k = — 1 in the proof of Theorem 12.11 in |DM98j yields the following (for an 
expository account of this proof, see |Larl2j ). 

Corollary 2.4. Let A' G Gr(fc, n) be generic, and let [f] G Pr„ +1 (d) be generic 
such that A' C V(f). IfS(n,d,k) = ; then dimF k ,_i(^(f)) = 0. 

In particular, there exists in this case a fc-plane distinct from A' (in fact, disjoint 
from it) contained in i r j.(V(f)). Combining with Theorem 12.31 will give a precise 
characterization of identifiability in SSA. 

Corollary 2.5. Let A' G Gr(fc, n) be generic, and let [f] G Pr n+ i(d) be generic 
such that A' C V(f). Then F k (V(i)) = [A'] if and only if S(n, d, fc) < 0. 

2.2. Lower rank quadrics. We specialize now to intersections of quadrics, with 
all quadrics vanishing on a common fc-plane A', and all with rank r (meaning their 
Gram matrices all have rank r) . We begin with a result that makes no assumption 
about a common linear subspace. Since we only deal with quadrics, we set 

'k + 2^ 



8{n, s, fc) = (fc + l)(n — fc) — s 



2 



Proposition 2.6. Let q = (q^ , . . . , q^), be generic homogeneous quadratic forms 
of rank r. If r > 2k + 2 and 5(n, s, fc) > 0, then ^(^(q)) has dimension 5(n, s, fc). 

Proof. An immediate corollary of Theorem 2.1 of DM98 (Theorem 12.11 above) is 
that for a fc-plane Ao G Fk(V(q)), the Jacobian matrix defining the tangent space 
T[A ]Fk{V(q)) is full-rank. Choose coordinates such that A = Span(eo, . . . , e/c), 
and consider the coordinate patch Uq of the Grassmannian Gr(fc, n) centered at A 
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with coordinates {x a .b ■ < a < n, k + 1 < b < n}, and corresponding tangent space 
coordinates denoted by X a ^. For each i £ {1, . . . , s}, let q u %, where < u < v < n, 
be coordinates for the vector space of homogeneous quadratic polynomials (i.e. 
the entries of the quadratic form's Gram matrix lying on or above the diagonal — 
see Figure 12.21 for an example) . The defining equations for the tangent space of 
F fe (F(q)) at [A ] are 

(2-2) q% +1 X u , k+1 + ... + = 0, 

+ &+i x ^k+i + ■■■ + q$ n X u , n = 0, 
for all £ € {1, ... , s} and u, v G {0, . . . , k} with u < v. The first main ingredient 
of the proof is that these equations depend only on the coordinates <j4*,i satisfying 
0<u<k<v<n. 

The second ingredient is a particular choice of coordinates for quadrics of rank 
at most r, given by all coordinate functions q u % except for those appearing in the 
bottom right (n+1 — r) x + 1 — r) corner of the quadric's Gram matrix (see Figure 
I2.2[) . Note that, by our assumption on r, these excluded entries are never among 
the coordinate functions appearing in the defining equations of the tangent space 
(|2.2p ~ (|2.3p . For each i S {1, . . . , s}, these coordinates are valid on an open subset 
of r n+ i(2), which we denote by U r . We next show that U r n S\ is non-empty. 

To prove this claim, let be the Gram matrix of the quadratic form q^K By 
assumption all (r + 1) x (r + 1) minors of QW vanish. The resulting equations 
enable us to write the omitted coordinate functions as rational expressions in the 
remaining coordinate functions, and the locus in r„ + i(2) where the denominators 
of these rational expressions do not vanish is precisely the open set U r where these 
coordinates are valid. 

Specifically, fix u, v with r < u < v < n. Let Qui be the (r+1) x (r+1) submatrix 
obtained by deleting the rows indexed by {r, . . . , n} \ {u} and the columns indexed 
by {r, . . . , n} \ {v}. The corresponding minor is 

= detQ«= II itlk) 

aeSym({0,...,r-l,ti},{0,...,r-l,u}) fee{0,...,r-l,u} 
creSym({0,...,r-l},{0,...,r-l}) fce{0,...,r-l} 

+ e n ?2(«. 

o-eSym({0,...,r-l,«},{0 1 ...,r-l,ii}), fce{0,...,r-l,«} 

(i) 

which gives the promised rational expression for q u 'v, with denominator equal to 
the top- left r x r minor of so that U r is the complement of the vanishing locus 
of this minor. Now it is easy to see that U r D S\ is non-empty, since the quadric 

(i) . (i) 

with Gram matrix having entries q u ' v = 1 it u + v = r and q u ' v = otherwise lies 
in this intersection. 

To finish the proof, Theorem [2J] guarantees that for generic q = (q^\ . . . , q^ s '), 
the Fano scheme Fk(V(q)) is of the expected dimension, equivalently, the equations 
(|2.2p ~ (|2.3p are full-rank. By the observation following these equations, it therefore 
follows that for generic coordinate choices q u 'v, £ = 1, . . . , s with 0<u<k<v<n, 
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equations (|2.2p - (|2.3p are full-rank. Hence a generic intersection of quadrics of rank 
at most r that lies in the coordinate patch described above is also in the open 
set where the dimension of Fk(V(q)) matches the expected dimension. Since the 
variety parametrizing symmetric matrices of rank r is irreducible, the result follows. 

□ 

The proofs of Corollary 12.21 and Theorem 12.31 require only notational changes to 
extend the above arguments to conditional genericity, where the additional condi- 
tion is containment of a fixed fc-plane A'. 

Corollary 2.7. Let A' G Gr(fc, n) be generic, and letq = (qW,...,q^) be generic 
homogeneous quadratic forms of rank r such that A' C V(q) and r > 2k + 2. 

(1) IfS(n,s,k) > 0, then dimF k {V(q)) = S(n,s,k). 

(2) IfS(n,s,k) < 0, thenF k (V(q)) = [A']. 

A direct extension of Corollary 12.51 now leads to a precise determination of an 
idcntifiability criterion for SSA. 

Corollary 2.8. Under the assumptions of Corollary \2.7[ Fk(V(q) = [A'] if and 
only if 5{n, s, k) < 0. 

It would be interesting to extend these lower-rank results to other multidegrees. 
For our application to machine learning, this generalization would eliminate the 
need to assume that only knowledge of the first two cumulants is available (see 
Section |3]). There are however two significant obstacles to extending the techniques 
used here. First, it is not known in general if the variety parametrizing lower rank 
symmetric tensors of degree > 3 is irreducible. And second, coordinates like the 
ones used for quadrics are not known in general. 

3. Application to stationary subspace analysis 

In this section, we show how the preceding results can be applied in machine 
learning. A central task in multivariate time series analysis is to separate out data 
coming from different sources, e.g. filtering out noisy data, or identifying a time- 
stationary data source from time- varying sources. A recent method for this second 
task is stationary subspace analysis (SSA) [vBMKM09 . 

Let {Xq 1 ^, . . . , Xq 1 , . . . , Xs , . . . , Xg'} be time series data separated into 
s + 1 consecutive epochs, with Xf £ C™ +1 for all Each epoch is modeled as a 
random variable JQ. A central assumption of SSA is that these data come from a 
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linear superposition of a k + 1 dimensional stationary signal, and ann-t dimen- 
sional non-stationary signal. The task of identifying the stationary signal is now 
equivalent to finding a projection matrix P £ C^ +1 ^ x ^ n+1 - ) such that PXq, . . . , PX S 
are identically distributed, i.e. 

(3.1) PX Q ~ PX\ PX S . 

To make this problem well-defined, we must further assume that the data X\- 
and hence the random variables Xi — are general under the assumption that such 
a projection P exists. An important problem in SSA is to determine the minimal 
number N of epochs required to uniquely determine P. 

The connection to algebraic geometry and Fano schemes arises through cumu- 
lants. Let X be a random variable taking values in C™ +1 , then the dth cumulant of 
X is a symmetric tensor of degree d. For example, the first cumulant is the mean 
vector of X, and the second cumulant is the covariance matrix of X. In general, 
the e?th cumulant of X can be represented as a homogeneous polynomial in n + 1 
variables of degree d (see e.g. Appendix 2 of |Eis95j. For z = [zq, . . . , z n ), we 
define the dth cumulant polynomial Kx,d{z) of X by the generating function 

oo 

gx(z) = logE(exp(z • Xj) = ^K X ,d{ z )- 

d=0 

If the cumulant generating functions of two random variables have finite radii of 
convergence, then by taking Fourier transorms it can be shown that these two 
random variables are identically distributed if and only if their cumulants coincide, 
just as with random variables taking values in C. Hence the defining property of 
P from Equation (|3.ip translates into an a priori infinite system of homogeneous 
algebraic equations, 

(3.2) P o K i:d (z) = P o K jt d(z), 

for all d > and all < i < j < s, where we have abbreviated KXi,d( z ) by 
Ki t d(z)i and the action of P on Ki,d{z) is induced from its action on C n+1 . We 



refer to Section 2 of KvB M + 12 for more details. To avoid an infinite number of 
equations, in SSA it is assumed that the stationary and non-stationary signals can 
be separated from one another by considering only the first two cumulants. 

To connect Equations (|3.2p to Fano schemes, let A' C C™ +1 be the row-span of 
the matrix P. For 1 < i < s, set f^' — K^d — «o,d G r(°0- Then Equations (|3.2p 
are satisfied if and only if all polynomials vanish identically on A', which is 
equivalent to the condition [A'] e Fk(V{f {1 \ f^)). In case d = 1, the resulting 
equations are linear, so by the genericity assumption on the X,, these equations only 
reduce the ambient dimension. Hence Fano schemes of intersections of quadrics is 
the main case of interest for SSA. Since the number of epochs tends to be large, 
the case of two epochs — i.e. a single quadric — can be reasonably omitted. 

The specific application of our results on Fano schemes to SSA is to give a 
precise characterization of the number of epochs required to uniquely identify the 
projection P. An upper bound for the identifiability of the SSA problem was proven 



in the Appendix of jKvBM+12 



Theorem 3.1. Let Ao, . . . , X s be generic random variables in C n+1 conditional 
upon the existence of a projection P £ c( fe + 1 ) x (™+ 1 ) such that the first two cumu- 
lants of all PXi coincide. Then P is uniquely identifiable if s > (n + 2)/k. 
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The results from Section [2] sharpen the above result and completely solve the SSA 
idcntifiability problem for the case of the first two cumulants: 

Theorem 3.2. Let X$, . . . , X s be generic random variables in C n+1 conditional 
upon the existence of a projection P G C( k+1 ) x ( n+i ) such that the first two cumu- 
lants of all PXi coincide. Then P is uniquely identifiable if s > 2(n — k)/(k + 1) , 
and this bound is sharp. 

In some SSA problems, the differences of covariance matrices of the random vari- 
ables Xi arc rank deficient. As long as the rank is not too low, Corollary 
provides a generalization of Theorem 13.21 for these situations. 
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